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Abstract 

Cheeger's fundamental inequality states that any edge-weighted graph has a vertex subset S such 
that its expansion (a.k.a. conductance) is bounded as follows: 

def W{S, S) ^ o A- 

mm{w{S),w{S)} 

where w is the total edge weight of a subset or a cut and A2 is the second smallest eigenvalue of the 
normalized Laplacian of the graph. Here we prove the following natural generalization: for any integer 
k € [n], there exist ck disjoint subsets Si, ... , Sck, such that 

max. cj){Si) ^ C-\/Xk log k 

i 

where Xi is the i*^ smallest eigenvalue of the normalized Laplacian and c < 1, C > are suitable absolute 
constants. Our proof is via a polynomial-time algorithm to find such subsets, consisting of a spectral 
projection and a randomized rounding. As a consequence, we get the same upper bound for the small 
set expansion problem, namely for any k, there is a subset S whose weight is at most a 0{l/k) fraction 
of the total weight and 4>{S) ^ C^\k logfc. Both results are the best possible up to constant factors. 

The underlying algorithmic problem, namely finding k subsets such that the maximum expansion is 
minimized, besides extending sparse cuts to more than one subset, appears to be a natural clustering 
problem in its own right. 
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1 Introduction 



Given an edge- weighted graph G — {V, E), a fundamental problem is to find a subset S of vertices such that 
the total weight of edges leaving it is as small as possible compared to its size. This latter quantity, called 
expansion or conductance is defined as: 

(j)G{S) w{S,S) 

min{K;(5), ^(5^)} 

where by wiS) we denote the total weight of edges incident to vertices in S and w{S, T) is the total weight 
of edges between vertex subsets S and T. The expansion of the graph G is defined as 

(j)G =^ min (jjaiS). 

Finding the optimal subset that minimizes expansion (pQ^S) is known as the sparsest cut problem. The 
expansion of a graph and the problem of approximating it (sparsest cut problem) have been highly influential 
in the study of algorithms and complexity, and have exhibited deep connections to many other areas of 
mathematics. In particular, motivated by its applications and the NP-hardness of the problem, the study of 
approximation algorithms for sparsest cut has been a very fruitful area of research. 

In this line, the fundamental Cheeger's inequality (shown for graphs in |Alo86|, IAM85| ) establishes a 
bound on expansion via the spectrum of the graph. 

Theorem 1.1 (Cheeger's Inequality ( |Alo86[ FAMSS] )). For any graph G, 

A2 



2 s^^Z-gs^SVAz 

where A2 is the second smallest eigenvalue of the normalized Laplacian ^ of G . 

The proof of Cheeger's inequality is algorithmic, using the eigenvector corresponding to the second 
smallest eigenvalue. This theorem and its many (minor) variants have played a major role in the design of 
algorithms as well as in understanding the limits of computation. 

It has remained open to extend the sparsest cut problem to more than one subset and recover a suitable 
generalization of the above theorem. A natural way to do this is to consider a partition of the graph and 
some measure of the edges across parts. We survey such extensions in Section |1.1[ Spectral partitioning 
algorithms are widely used in practice for their efficiency and the high quality of solutions that they often 
provide f |BS94llHL95| ). An additional motivation for this work is from large unstructured data, where sparse 
cuts allow a large graph to be analyzed after applying the cuts. Here we focus on the following k sparse cuts 
problem: 

A:-sparse-cuts; given an edge weighted graph G — {V,E) and an integer 1 ^ k ^ n, find k disjoint subsets 
Si, . . . , Sk of V such that max; (jjciSi) is minimized. 

We note that sparsest cut corresponds to the case A: = 2 as both sides of the cut are sparse. Notice that 
the sets Si, . . . , Sk need not form a partition, i.e., there could be vertices that do not belong to any of the 
sets. The problem is a good way to model the existence of several well-formed clusters in a graph without 
requiring a partition, i.e., there could be vertices that do not participate in any cluster. A natural question 
is whether the problem is connected to higher eigenvalues of the graph. Infact it is not hard to show that 
the fc*'' smallest eigenvalue of the normalized Laplacian of the graph gives a lower bound to the fc-sparse 
cuts problem. 



^See Section 1.2 for the definition of the normalized Laplacian of a graph. 
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Proposition 1.2. For any edge-weighted graph G = {V,E), for any integer 1 ^ fc ^ \V\, and Jor any k 
disjoint subsets Si, . . . , Sk C V 

ma.x(t)G{Si) > 

i Z 

where Ai, . . . , X\v\ o-fe the eigenvalues of the normalized Laplacian of G. 

Our main result is an upper bound in terms of Afe analogous to Cheeger's inequality. 

Theorem 1.3. For any edge-weighted graph G = (V, E). and any integer 1 ^ fc ^ \V\, there exist ck disjoint 
subsets Si, ... , Sck of vertices such that 



max 



where Ai, . . . , A|y| are the eigenvalues of the normalized Laplacian of G and c < 1,G are absolute constants. 
Moreover, these sets can be identified in polynomial time. 

The proof of this theorem is algorithmic and is based on spectral projection. Starting with the embedding 
given by the top k eigenvectors of the (normalized) Laplacian of the graph, a simple randomized rounding 
procedure is used to produce k vectors having disjoint support, and then a Cheeger cut is obtained from 
each of these vectors. 

In general, one can not prove an upper bound better than 0{\/Xk log/c) for fc-sparse cuts. This bound 
is matched by the family of Gaussian graphs. For a constant e € (—1, 1), let Nk.e denote the infinite graph 
over M.^ where the weight of an edge {x, y) is the probability that two standard Gaussian random vectors 
X,Y with correlation ^ e equal x and y respectively. The first k eigenvalues of the Laplacian of N^^e are at 
most £ QRSTlObp . The following Lemma bounds the expansion of small sets in Nk^g. 

Lemma 1.4 ( |Bor85( IRSTlOb] ) . For any set S C M*^ with Gaussian probability measure at most 1/k, 

^N,AS);?ni^eiogk). 

Therefore, for any k disjoint subsets Si, . . . ,Sk of Nk^e j niaxi (f>N^ ^ (Si) > fl{\/Xk log k). 

As an immediate consequence of Theorem |1.3[ we get the following optimal bound on the small-set 
expansion problem f [RS10 , R ST10a| ). This problem arose in the context of understanding the Unique 
Games Conjecture and has a close connection to it f |RS10[ [ABSIOI V 

Corollary 1.5. For any edge-weighted graph G — {V,E) and any integer 1 ^ fc ^ \V\, there is a .subset S 
with w{S) = 0{l/k)w{V) and 0g(S') ^ G\/Xk log k for an absolute constant G. 



Theorem 



1.3 also implies an upper bound of 0{k^/Xk log fc) on max^ (t>{Si) for the case when the ck sets 



are required to form a partition of the vertex set. 

Corollary 1.6. For any edge-weighted graph G — {V, E) and any integer 1 ^ fc ^ \V\, there exists a partition 
of the vertex set V into ck parts Si, ... , Sck such that 



:cj)iS,) ^Oik^/Xjogk) 



for an absolute constant c. 



Complementing the above bound, we show that for a k- partition Si, S2, . • • , Sk, the quantity max^ ipc^Si) 
cannot be bounded by ©(-y/A/TpolylogA:) in general. We view this as further evidence suggesting that the k- 
sparse-cuts problem is the right generalization of sparsest cut. 

Theorem 1.7. There exists a family of graphs .such that for any k-partition {Si, . . . , Sk) of the vertex set 

ma.x(j)G{Si) ^ G min | -^,n<^ \ \fXk. 



e correlated Gaussiaiis can be constructed as follows : X ~ M(0, 1)'' and F ~ (1 - e)X + \/2e -e^Z where Z ~ Ar(0, 1)* 
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1.1 Related work 



The classic sparsest cut problem has been extensively studied, and is known to be intimately connected 
to metric geometry |LLR95[ IAR98| . The lower and upper bounds on the sparsest cut given by Cheeger's 
inequality yields a ©(VOPT) approximation algorithm for the sparsest cut problem. There are other known 
algorithms which have multiplicative approximation factors of O(logn) via an LP relaxation [LR99[ lARQSj 
and 0{y/\og n) via a semi-definite relaxation jARV04j ) . In many contexts, and in practice, the eigenvector 
approach is often preferred in spite of a higher worst-case approximation factor. 

The small set expansion problem - a generalization of sparsest cut has received much attention owing to 
its connection to the unique games conjecture. Here the goal is to find a subset of minimum sparsity that 
contains at most a 0{l/k) fraction of the vertices. In terms of eigenvalues, this quantity was shown to be 
upper bounded by ^(VA fcslogfc) in [LRTVllj . and by ©(^Afcioo log^ n) in [ABSlOj . Using a semidefinite 
programming relaxation, [RSTlOa] obtain an algorithm that outputs a small set with expansion at most 
-/OPT log k where OPT is the sparsity of the optimal set of size at most 0{l/k). Bansal et.al. |BFK+ll] 
obtained an 0(-\/lognlog k) approximation algorithm for the small set expansion problem using a semidefinite 
programming relaxation. 

A second generalization that has been studied is that of partitioning the vertex set of a graph into k parts 
so as to minimize the sparsity of the partition (defined as the ratio of the weight of edges between parts to 
the total weight of edges incident to the smallest k — 1 parts) . |LRTVll| give a polynomial time algorithm for 
finding a fc-partition with sparsity at most S-v/A^logfe. Their proof is based on a simple recursive algorithm. 
Closer to this is the (a, e)-clustering problem that asks for a partition where each part has conductance at 
least a and the total weight of edges removed is minimized. |KVV04| give a recursive algorithm to obtain 
a bi-criteria approximation to the (a, e)-clustering problem. Indeed recursive algorithms are one of most 
commonly used techniques in practice for graph multi-partitioning. However, we show that partitioning a 
graph into k pieces using a simple recursive algorithm can yield as many fc(l — o(l)) sets with expansion 
much larger than \/AfcPolylogA: (See Appendix |A[) . 

|GT11) studied a close variant of the problem we consider, and show that every graph G has a k partition 
such that each part has expansion at most 0{k^y/Xk). Other generalizations of the sparsest cut problem 
have been considered for special classes of graphs f |BLR101 iKe lOG. ST96 ). 

A randomized rounding step similar to the one in our algorithm was used previously in the context of 
rounding semidefinite programs for unique games ( |CMM06j ). 



1.2 Notation 

For a graph G = {V,E), we let A be its (weighted) adjacency matrix and di be the (weighted) degree of 
vertex i. We use D to denote the diagonal matrix with Da = di. The normalized Laplacian of a graph 
defined as 

Cg = d-'^d-a)d--^ 

We let = Ai ^ A2 ^ ...An denote the eigenvalues of Cq and v'i,V2, . . . ,v'^ denote the corresponding 
eigenvectors. Let Vi =^ D~^v[ for each i e [n]. Therefore, v'^ Cgv[ = X]u~t«(^«(") ~ Since Vi ^ j 



1.3 Organization 

We begin with our main algorithm used in the proof of Theorem |1.3| in Section [2j In Section |3.1| we give 
an overview of the proof of Theorem |1.3[ We present some preliminaries needed for our proof in Section |3.2| 
and in Section [STS] we present the proof of Theorem 1.3 In Section |4] we present the proof of Proposition 
1.2 and finally in Section [5] we present the proof of Theorem L7 
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2 Algorithm 

Our algorithm for finding Q{k) sparse cuts appears in Figure 



Input : Graph G = {V,E), parameter k. 

1. Spectral projection. Let V — [vi, ... ,Vk] he an nxk matrix whose columns are the top k eigenvectors 
of the normalized Laplacian Cq', let ui, . . . , m„ be the rows of V . 

2. Randomized rounding. Pick k independent Gaussian vectors 51,32, ■ ■ ■ ,gk ^ A/'(0, 1)''. Construct 
vectors hi, h2, . . . , hk G M" as follows: 




{uj,g^) if i = argmax,g[fe]{(u g,)} 
otherwise. 



3. Cheeger cuts. For j — 1, . . . , fc, sort the cooordinates of hj according to their magnitude, and pick 
the subset of lowest sparsity in this linear ordering. 

4. Output all subsets of expansion smaller than 0{^/Xk log k). 



Figure 1: The Many-sparse-cuts Algorithm 



3 Analysis 
3.1 Outline 

Notice that the vectors /ii, /12, • ■ • , /ifc have disjoint support since for each coordinate j, exactly one of the 
{uj,gi) is maximum. Therefore, the Cheeger cuts obtained by the vectors hi yield k disjoint sets. It is 
sufficient to show that a constant fraction of the sets so produced have small expansion. 

As first attempt to proving the upper bound in Theorem |1.3[ one could first try to bound the Rayleigh 
quotient of the vectors {hi} by ©(Afelogfc) (say for a constant fraction of vectors hi). This would imply 
that the corresponding sets would have value 0{y/Xk log fc) by following the proof of Checger's inequality. 
Unfortunately, we note that the Rayleigh quotient of the vectors obtained could themselves be as high as 
il{\/Xk log fc), and using the proof of Checger's inequality this would at best yield a bound of 0{{Xk log fc)i) 
on the expansion of the sets obtained. Therefore, in our proof we directly analyze the quality of the Cheeger 
cuts finally output by the algorithm. 

We will show that for each z € {1, . . . , fc}, the vector hi has a constant probability of yielding a cut with 
small expansion. The outline of the proof is as follows. Let / denote the vector hi. The choice of the index 
1 is arbitrary and the same analysis is applicable to all other indices i G [m]. 

The quality of the Cheeger cut obtained from / can be upper bounded using the following standard 
Lemma used in most known algorithms for sparsest cut. A proof of this Lemma can be found in [Chu97| . 

Lemma 3.1. Let x £ M" be a vector such that ^ 6. Then one of the level sets, say S, of the 

vector X has (j^ciS) ^ 2S. 

Applying Lemma |3.1[ the expansion of the set retrieved from f = hi is upper bounded by, 

J2(i,j)eE \ fi ^ fj\ 
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Both the numerator and denominator are random variables depending on the choice of random Gaussians 
gi,. . . ,gk. It is a fairly straightforward calculation to bound the expected value of the denominator. 



Lemma 3.2. 



21ogfc^E Y^diff ^41og/e. 



Bounding the expected value of the numerator is more subtle. We show the following bound on the 
expected value of the numerator. 



Notice that the ratio of their expected values is 0{yyXk logfc), as intended. To control the ratio of the 
two quantities, the numerator is to be bounded from above, and the denominator is to be bounded from 
below. A simple Markov inequality can be used to upper bound the probability that the numerator is much 
larger than its expectation. To control the denominator, we bound its variance. Specifically, we will show 
the following bound on the variance of the denominator. 



The above moment bounds are sufficient to conclude that with constant probability, the ratio — '''^^^^d /^ — ' 
is within a constant factor of 0(-\/Afelogfc). Therefore, with constant probability over the choice of the Gaus- 
sians gi,. . . ,gk, il{k) of the vectors hi,. . .hk yield sets of expansion ©(V^fc log A:). 

3.2 Preliminaries 

We collect here some useful properties of the spectral embedding, the randomized rounding, and a well-known 
analysis of Cheeger cuts. 

Spectral Embedding 

Lemma 3.5. (Spectral embedding) 



Lemma 3.3. 



E < 8(2ci + l)(^A^log? fc). 



Lemma 3.4. 



Var ^diff ^321og^fc. 



1. 



2. 



Y.d4u,r = k 



3. 




Proof of (3). 



i,j i,j \t=l / 



= ^ didj ^ Ui{t-i)uj{ti)ui{h)uj{t2) 

i-j tiM 
= ^^didjUi{ti)Uj{ti)Ui{t2)Uj{t2) 



ti,t2 \ i / 

Since \/diUi{ti) is the entry to corresponding to vertex i in the t*l^ eigenvector, diUi{ti)ui{t2) is equal 
to the inner product of the tf^ and eigenvectors of C, which is equal to 1 only when ti = t2 and is equal 
to otherwise. Therefore, 

^ didj{ui,Uj)'^ = '^I[ti=t2]=k 

i,j ti,t2 

□ 

Next, we recall the one-sided Chebychev inequality. 

Fact 3.6 (One-sided Chebychev Inequality). For a random variable X with mean /j. and variance cr^ and 
any t > 0, 

P[X<M-ia]< 

Properties of Gaussian Variables The next few facts are folklore about Gaussians. Let ti/i- denote 
the cap of a standard normal variable, i.e., ti/k € M is the number such that for a standard normal 

random variable X,V[X^ ^i/fe] = 

Fact 3.7. For a standard normal random variable X and for every t > 0, 



ti/k « V 2 log/? - log log fc 

Fact 3.8. Let Xi,X2, . . . ,Xk be k independent standard normal random variables. Let Y be the random 
variable defined as Y =^max{Xj|i S [k]}. Then 

1. ti/k^E[Y]^2y/I^ 

2. E <^ 41ogfc 

3. E [r^] <^ 4elog^fc 

4. For any constant e, P [F ^ (1 — ^ 

Proof. For any Zi,. . .,Z}. e M+ and any p e Z+, we have maxj Zi ^ (^^ Z^)p . Now = (maxj XiY ^ 
maxiX/. 



E ^ E 



ip 



^ E 



( Jensen's Inequality ) 
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Picking p ~ log k gives E [y*] ^ 4elog^fc. 

Therefore E \Y^] < ^E]^ < 41og fc and E [Y] ^ ^JW\r^ 2^/Iogfc 
And, 

A; 



1 



1 



□ 



Fact 3.9. ietXi,...,Xfe anc? Yi, . . . , Yfc be i.i.d. standard normal random variables such that for alii [k], 
the covariance of Xi and Yi is 1 — . Then 

P[argmax,Xi 7^ argmaXj^i] ci (e0ogfc 



/or some absolute constant Ci. 

We refer the reader to |CMM06] for the proof of a more general claim. 

3.3 Main Proof 

Let / denote the vector hi. The choice of the index 1 is arbitrary and the same analysis is applicable to all 
other indices i S [m] . We first separately bound the expectations of the numerator and denominator of the 
sparsity of each cut, and then the variance of the denominator. The proofs of these bounds will follow their 
application in the proof of our main theorem. 

Expectation of the Denominator Bounding the expectation of the denominator is a straightforward 
calculation as shown below. 



Lemma 3.10 (Restatement of Lemma 3.2 1 



2 log fc < E 



< 4 log A:. 



Proof of Lemma 3. 2 For any i e [n] , recall that 



\\ui\\{ur,gi) if > {ui.gj) yj e [k] 

otherwise. 



The first case happens with probability 1/fc and so fi — with the remaining probability. Therefore, using 
Fact|3]8l 

2|lu,||2logfc/fcs^E[/2] ^i\\u,\\Hogk/k 

and hence 



2 log fc sC E 



^ 4 log fc 



using "^ill^ilP — ^ from Lemma 



3.5 



□ 



Expectation of the Numerator For bounding the expecation of the numerator we will need some 
preparation. We will make use of the following proposition which relates distance between two vectors to 
the distance between the unit vectors in the corresponding directions. 

Proposition 3.11. For any two non zero vectors Ui and Uj, if Ui — Ui/\\ui\\ and Uj — Uj/\\uj\\ then 

kP + \\u,P € 2||u,: - 
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Proof. Note that 2|lui|||lMj|l |litj|p + Hence, 

\\u.-Ujri\M' + \\u,r) ^ (2-2(u~,u,))(||«,|p + ||u,||2) 



If {ui,Uj) ^ 0, then 
Else if {uijUj) < 0, then 



□ 



We wiU also make use of the following propositions which bounds the expected value of a conditioned 
random variable. 

Proposition 3.12. For indices i ^ j 

E[(m„5i)2|/, >0]P[/, >0] ^ ^\\u,f\ogk 

Proof. 



E[(u„5i)2|/,. >0]P[/, >0] E 



max('Ui,5p)^|(uj,.gi) > {u.j,gi)yi G [k] 



96 [fc] 



pe[fc] 



[u,,gp)'^\{uj,gq) > {uj,gi)yi G 



E 



max(ui,gp)^ 

P6 [fc] 



|u^||logA: 



Similiarly, we also prove the following proposition. 
Proposition 3.13. For indices i ^ j 



□ 



E [{u^,gl)V^ > and /, = O] P [/, = 0] ^ 4 1 - - \\u,\\^\ogk 



k 

We will need another Lemma that is a direct consequence of Fact |3.9| about the maximum of k i.i.d 
normal random variables. 



Proposition 3.14. For any i,j S [n], 



P [/, > and J J = 0] < ci 



i/Iogfc 
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We are now ready to bound the expectation of the numerator, we restate the Lemma for the convenience 
of the reader. 



Lemma 3.15. (Restatement of Lemma 3.3) 



E 



Eli?-/ 



< 8(2ci + l)(yA^log5 k). 



Proof of Lemma\3l\ From Fact 3.8 E[ff\f,>0] 4\\u,f logk. Therefore, 

>0] = E[\{u„g,f-{u„gif\\f,,fj>0] 

= E[\{ui +Uj,gi)\\fi,fj > 0] 

^E[{u,~u„giy\f,,f, > 0]^E[{u, + u„g,y\f,J, > 0] 



Now, 



E [(u, - /,-> 0] P [/„/,> 0] 



{ui - Uj,gi)'^ 



< / {u, - Uj,gif - E [(u, - u,,gi)^\f, > O] P [/, > 0] 

Jfi>0 



SimiUarly, we get 



max(ui -Uj,gi) \{ui,gp) > {ui,gi)yi G [k] 



{ui,gp) > {u.„gi)yi e [k]] 



E 



max(Mi - Uj.giY 



^ -\\ui ~ Ujiplogfc 



E[K + u„gi)V«,/j >0]P[/^,/, >0] sC -\\u.,+u,f\ogk 

Therefore, we get 

E[|/f-/J||/.,/, >0]P[/„/, >0] ^ ^\\u,-uMu^ + UJ\\\ogk 
From Proposition |3.14[ 



[/, > and fj = 0] = P [fj > and = 0] ci{\\u, - Uj\\^/I^/k). 



Therefore, 



E [/f - f!\^^ > 0, /, = 0] P [/,; > 0, /, = 0] - E [{u,,g)'\f, > 0, /, - O] P [/, = 0] 



P [f^ > 0, f, = 0] 

= 0] 



^ 4(1- j-)||u,||Mogfc '^^'^'^°'{^~"^ ( Proposition [Slsl) 
k 1 — 1/ k ' ' 

4ci , a , II ii2ii ~ ~ II 



Similarly, 



E [f] - fl\fj > 0, /, = 0] P [/, > 0, f, = 0] < ^ log5 k\\u,f\\u, - u,\\.J\ogk 
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Next, 



E 



■2 I 

j 



41ogfc [ Y^ii 

k 

41ogfc 



- + \\u,j\\) + 2ci||Wi - + Vlogfc 



( from Lemma |3.11[ ) 
^(2ci + l)logj fc ^„ 



^ — ~^ ^ ^jH)^ + ll'^jll)^ (Cauchy-Schwartz inequality) 



4(2ci - 


f 1) log 


k 




k 




4(2ci - 


f 1) log 






k 




4(2ci - 


f 1) log 


3 , 

^2 k 




k 




8(2ci - 


f 1) log 


3 , 

■2 A; 



■J 1 1 1 1 ^ 



8(2ci + l)log5 fcVA 



□ 



Variance of the Denominator Here too we will need some groundwork. 

Let Q denote the Gaussian space. The Hermite polynomials {-ffi}igz>o form an orthonormal basis for 
real valued functions over the Gaussian space Q, i.e., ¥.g(zg[Hi{g)Hj{g)] ~ 1 \i i — j and otherwise. The 
fc-wise tensor product of the Hermite basis forms an orthonormal basis for functions over . Specifically, 
for each a e Z>q define the polynomial Ha as 

k 

Ha{xi, ...,Xk)^ 'Y\_Ha,{Xi). 

i=l 

The functions {^^alagz^ form an orthonormal basis for functions over Q'^. The degree of the polynomial 

Ha{x) denote by \a\ is |a| = a^. 

The Hermite polynomials form a complete eigenbasis for the noise operator on the Gaussian space 
(Ornstein-Uhlenbeck operator). In particular, they are known to satisfy the following property (see e.g. 
the book of Ledoux and Talagrand [LT91 , Section 3.2). 

Fact 3.16. Let {gi, hi)^^^ he k independent samples from two p-correlated Gaussians, i.e., &[g1] — E[/i|] — \, 
and E[(7i/ii] — p. Then for all a G Z^q, 

E[Ha{gi, ■ . ■ , gk)Ha'{hi, . . . , hk)] — p'"' if a = a' and otherwise 
Let A : — > E be the function defined as follows. 



A{x) 



x\ if (xi > Xiii € [k]) V {xi ^ xi^i £ [fc]) 
otherwise 
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We know that 



ElA] ^ ^ and E[A'] ^ (Fact[3l 



k 



Lemma 3.17. Let u,v be unit vectors and gi, . . . ,gk be i.i.d Gaussian vectors. Then, 



E[A{{u, gi), . . . , (w, <7fc))A((v, gi), . . . , {v, gk))] < 



16 log^ k 



Proof. The function A on the Gaussian space can be written in the Hermite expansion A{x) — "Ylia ^aHaix) 
such that 



E,Q „r .9, 16 log k 
Al=E[A']^ 



Using Fact |3.16[ we can write 



E[Ai{u,g,),...,{u,gk))Ai{v,g,),...,{v,gk))]^mA]f+ ^ 

where p = (m, v). Since A is an even function, only the even degree coefficients are non-zero, i.e., Aa = for 
all \a\ odd. Along with p < 1, this implies that 



E[Ai{u,gi),...,{u,gk))Ai{v,g,),...,{v,gk))]^{E[A]f ^ Al] where p={u,v) 



< 



161og^fc 2l61og^fc 



A;2 



k 



Now, we will show a bound on the variance of the denominator. 
Proof of Lemma \3.4\ 



E 



= ^dd,\W^f\\^jfE 



^ '^d^dJ\\u^f\\ujfE[A{{u,,gl),. . . , {u,, gk))A{{uj, gi) , . . . , {uj,gk)))] 



jemma 



3.17) 



161og^ k I -s-^ 2 1 , II ii2\ 
1 \2^d^dj{ui,Uj) + ^(2^d,||u^|| ) 



< 32 log^ k 

Therefore Var djf] = E fe, . d,d,ffff] - (E djf] ) 28 log^ k. 



□ 



□ 
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Putting It Together 



Proof of Theorem 1.3 Now, for each I g [fc], from Lemma 3.2 we get that E [X^i "^i^K*)^] ~ 6(logfc) and 
we get that Var \J2ii ^K*)^] = ©(log^ k). Therefore, from the One-sided Chebyshev inequal- 



from Lemma 



ity (Fact 3.6 1, we get 



3.4 



2 



where c' is some absolute constant. 



Therefore, with constant probability, for Q.{k) indices I £ [fc], dihi{i)'^ ^ 
with probability 1 - c'/2 , E.., Mif - hi{j)^\ 2/c'E [e,^, - /iiOT 

probability, for a constant fraction of the indices / G [fc] , we have 



EJ2^d,hiii 



Also, for each I, 
Therefore, with constant 



E 



Oi^Xk log A:) 



Applying Lemma 3.1 on the vectors with those indices will give fl{k) disjoint sets Si, ... , Sck such that 



This completes the proof of Theorem |1.3 



□ 



4 Lower bound for k Sparse Cuts 

In this Section, we prove Proposition |1.2[ 



Proposition 4.1 (Restatement of Proposition 1.2). For any edge-weighted graph G — {V,E), for any integer 
1 ^ A: ^ and for any k disjoint subsets Si, . . . , Sk C V 



max0G('S'j) ^ 



Xk 



where Ai, . . . , A|y| are the eigenvalues of the normalized Laplacian of G. 

Proof. Let a denote max^ (j^ciSi) Let T =^ V\{yJiSi). Let G" be the graph obtained by shrinking each piece 
in the partition {T,Si\i G [k]} of F to a single vertex. We denote the vertex corresponding to Si by Si Mi 
and the vertex corresponding to T by t. Let C be the normalized Laplacian matrix corresponding to G' . 
Note that, by construction, the expansion of every set in G" not containing t is at most a. 

Let U =^ {D^Xsi \i £ [m]}. Here Xs is the incidence vector of the set 5 C y. Since all the vectors in U 
are orthogonal to each other, we have 



Xk{C) 



x^Cx 



x'^ Cx 



S:rank{S) = k xeS x'^ X ^ xespan{U) x'^ X i;gR'=*{0} 



For any a; G ]R, let a;+ max{a;,0} and x max{— a;,0}. Then it is easily verified that for any 
y,,y, G E, (y, - y^f ^ 2((y+ - y+f + {y- - yjY). Therefore, 



E,; Ej>,; <o iy^ ~ yi? ^ '^iY.^ Ej>» <■ (vt - yt)^ + E,; Ej>^ (yj -y^ ?) 



^ 2(E.E,>,<l(y.r)' - {yJf\ + T..i:,>MMf - {yiY)\ 
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Without loss of generality, we may assume that j/J*" > > ■ • • ^ 2/^ ^ yt = 0. Let Ti = {si, . . . , s,} for 
each i G [k]. Therefore, we have 

k k k 

i j>i i—1 i—1 i 

Here we are using the fact that w'{E{Ti, Ti)) ^ aw'{Ti) which follows from the definition of a and that 
w'iT,+i) ~w'{Ti) = W'+i- 

Similiarly, we get that T,iT,j>iW[j\{y~)^ - (y,")^)| ^ aJ2^w',{y^)^ ■ 

Putting these two inequalities togethor we get that J2j>i ^'ijiVi — Uj)^ ^ 2a w-yf . 

Therefore, Afe(£) < 2ina,Xi (j)G{Si). 

□ 



5 /c-partition 

In this Section, we give a constructive proof of Theorem 1 1 . 7| The following construction shows that if restrict 
the fc-sets 5*1, . . . , Sk to form a partition of V, then max^ (pdSi) ^ f2(-v/Afepolylog/c). 




Figure 2: fc-partition can have sparsity much larger than rj(v'Afcpolylogfc) 



Lemma 5.1. For the graph G in Figure^ and for any k-partition Si, . . . , Sk of its vertex set, 

max, (j)GiS^) ^ Q^^2 IJ-^ 

Proof. In Figure [2] V« G [k], Si is a clique of size (n — l)/k (pick n so that k\{n— 1)). There is an edge from 
central vertex v to every other vertex of weight pn. Here p is some absolute constant. 

Let V =^ {5*1 U {v}, 5*2, S3, . . . , Sk}- For n > k^ , it is easily verified that the optimum fc-partition is 
isomorphic to T''. 

For k < 0(71,3), we have 

'pk^' 



max ^aiS.) = 0g(^i U {.}) = + pnk 



e 
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Applying Proposition 1.2 to Si, . . . , Sk, we get that Afc — 0{pk'^ /n). Thus we have the Lemma. 

□ 
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A Recursive Algorithms for graph mult i- partitioning 

The following construction (Figure |3| shows that partition of V obtained using the recursive algorithm of 
[LRTVll] can give as many as fc(l — o(l)) sets have expansion while < 0{k'^ /n^). 



SI 



52 



S3 



SP 




Figure 3: Recursive algorithm can give many sets with very small expansion 

dcf 

In this graph, there are p — k'^ sets S'i for 1 ^ i ^ fc^. We will fix the value of e later. Each of the Si has 
k^^^ cliques {Sij : 1 ^ j ^ k^^^} of size n/k which are sparsely connected to each other. The total weight 
of the edges from Sij to Si\Sij is equal to a constant c. In addition to this, there are also k — k'^ vertices 
Vi : 1 ^ i ^ k — k^ . The weight of edges from Si to Vj is equal to k^"^ . 



Claim A.l. 



1. (^(5,,) sC {c+l)kyn^ Vz,j 



2. (t>is,)^i/{c + i)^{s,j)yi,j 

3. \k = 0(w?ln^) 



Proof. 1 . 



(m— m^)m 



0(5., 



{ra—m^^m" 



(c + \)rr? 
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2. 'w{Si) = J2j but for each Sij only l/(c + 1) fraction of edges incident at Sij are also incident 

at S,. Therefore, <j){S,) < l/(c+ 



3. Follows from (1) and Proposition 1.2 



□ 

For appropriate values of e and fc, the partition output by the recursive algorithm will be {Si : i £ 
[fc^]} U : i e [fc — k'^]}- Hence, fc(l — o(l)) sets have expansion equal to 1. 
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